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CANDIDA KEFYR CYTOSINE DEAMINASE 
FIELD OF THE INVENTION 

The invention relates to a cytosine deaminase {CD) protein and cDNA from the 
yeast Candida kefyr, variants of the same and uses thereof. 
5 BACKGROUND OF THE INVENTION 

Cytosine deaminase (CD) is an enzyme that converts cytosine to uracil. The 
bacterial and fungal versions of this enzyme can also convert 5-fluorocytosine (5FC) to 
5-fluorouracil (5FU). However, the human and mouse enzyme does not recognize 5FC 
as a substrate. Bacterial and fungal CD converts 5FC to 5FU, which is then converted to 
10 5-fluoro-deoxyuridine monophosphate (SFdUMP) in all species. 5FdUMP is an 
irreversible inhibitor of thymidylate synthase, and the accumulation of SFdUMP leads to 
cell death by inhibiting DNA synthesis via deoxythymidine triphosphate (dTTP) 
deprivation. 

Because the human CD gene does not convert 5FC to 5FU, the pro-drug 5FC is 

15 only toxic in those human cells that are engineered to express a bacterial or fungal CD 
gene. This has been used to advantage in treating tumors, and is an example of a 
"suicide gene" system. The tumors are transformed with a bacterial or fungal CD gene, 
usually by direct injection, implantation or systemic administration of a vector 
containing the CD gene. The patient is then treated with 5FC and the toxic effects of 

20 5FU lead to death of the transformed cells that continue to divide. 

The suicide gene system has been studied extensively as an approach to treat 
malignant tumors. One of the advantages of the system is that incorporation of the 
suicide gene into every tumor cell is not necessary for effective therapy; complete tumor 
responses have been reported in animals when less than 20% of the ceils expressed the 

25 suicide gene. This phenomenon is known as the "bystander effect," and is based on the 
continued toxicity of the drug to neighboring cells when a particular cell -dies and 
releases its drug load (6). 

The suicide gene system requires accurate targeting because gene expression in a 
normal cell followed by exposure to the pro-drug will kill the cell when it attempts to 

30 divide. This problem has been addressed by placing the suicide gene under control of a 
tissue-specific (or preferentially a tumor-specific) promoter -so that the gene will be 
expressed only in a select population of targeted cells. The alpha-fetoprotein promoter, 
which is preferentially activated in hepatoma cells, is an example of «uch an approach 
(8). Because many promoter sequences are not completely tumor specific, the suicide 
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gene may be also be expressed in some amount of healthy tissue. This is not fatal to 
efficacy, however, because like most chemotherapy, the premise of the treatment is that 
actively dividing cells are preferentially targeted by the drug. 

Although suicide genes are a promising approach for the specific targeting of 
5 tumors, there is room for improvement in most aspects of the system. In particular, an 
enzyme with increased activity would allow the use of lower doses of 5FC, and avoid the 
reported immunosuppressive effects of high 5FC doses. The present invention provides 
one such improvement. 

SUMMARY OF THE INVENTION 

10 The term "fusion" is used to refer to chemically linked polypeptides (or nucleic 

acid encoding such polypeptide), to another peptide (or nucleic acid encoding same) with 
a known property which can be utilized to impart the known property on the entire fusion 
protein. The use of fusions is common in the art to facilitate protein purification and for 
visualizing the protein of interest. An example of a protein fusion is the expression of 
15 proteins from vector where the protein is operably linked to an intein <a self cleaving 
protein) which is operably linked to a binding domain. By affixing the substrate of the 
binding domain to a solid surface, the protein of interest can be bound to the surface, 
rinsed, and released under conditions which induce intein cleavage. Other examples of 
fusions include the use of antigenic tags (such as HIS or FLAG) that can be used to 

20 isolate or visualize the tagged protein. 

The term "humanized," as used herein, refers to protein coding sequences in 
which the codons have been converted to codons utilized more frequently in a human 
gene, while still retaining the original amino acid sequence. Similarly, "£. coii bias" 
refers to a gene optimized for expression in E. coli. 

25 The term "isolated," as used herein, refers to a nucleic acid or polypeptide 

removed from its native environment. An example of an isolated protein is a protein 
bound by a polyclonal antibody, rinsed to remove cellular debris, and utilized without 
further processing. Salt-cut protein preparations, size fractionated preparation, affinity- 
absorbed preparations, recombinant genes, recombinant protein, .cell extracts from host 

30 cells that expressed the recombinant nucleic acid, media into which the recombinant 
protein has been secreted, and the like are also included. The term "isolated" is used 
because, for example, a protein bound to a solid support via another protein is at most 
50% pure, yet isolated proteins are commonly and reliably used in the art. 
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"Purified," as used herein refers to nucleic acids or polypeptides separated from 
their natural environment so that they are at least 95% of total nucleic acid or 
polypeptide in a given sample. Protein purity is assessed herein by SD5-PAGE and 
silver staining. Nucleic acid purity is assessed by agarose gel and EtBr staining. 
5 The term "substantially purified," as used herein, refers to nucleic acid or protein 

sequences that are removed from their natural environment and are at least 75% pure. 
Preferably, at least 80, 85, or 90% purity is attained. 

The phrases "nucleic acid" or "nucleic acid sequence," as used herein, refers to 
polynucleotides, which may be gDNA, cDNA or RNA and which may be single- 
10 stranded or double-stranded. The term also includes peptide nucleic acid (PNA), or to 
any chemically DNA-like or RNA-like material. "cDNA" refers to copy DNA made 
from mRNA that is naturally occurring in a cell. "gDNA" refers to genomic DNA. 
Combinations of the same are also possible <i.e., a recombinant nucleic acid that is part 
gDNA and part cDNA). 

15 "Fragments" refers to those polypeptides (or nucleic acid sequences encoding 

such polypeptides) retaining antigenicity, a structural domain, or an enzymatic activity of 
the full-length protein. The "enzymatic activity" of the CD protein is herein defined to 
be the conversion of 5FC to 5FU. "Structural domains" including the conserved cytosine 
deaminase domain (residues 3-104) are as indicated in Fig. 5. 

20 The term "oligonucleotide," as used herein, refers to a nucleic acid sequence of at 

least about 15 nucleotides to 100 nucleotides, and all integers between. Preferably, 
oligonucleotides are about 18 to 30 nucleotides, and most preferably about 20 to 25 
nucleotides. Generally, an oligonucleotide must be greater than 22 to 25 nucleotides 
long for specificity, although shorter oligonucleotides will suffice in certain applications. 

25 The terms "operably associated" or "operably linked," as used herein, refer to 

functionally coupled nucleic acid sequences. 

A "variant" of CD polypeptides, as used herein, refers to an amino acid sequence 
that is altered by one or more amino acid residues. Such variations may be naturally 
occurring or synthetically prepared. Common variants include "conservative" changes, 

30 truncations, and domain removal or swapping with similar proteins. Guidance in 
determining which amino acid residues may be substituted, insetted, or deleted without 
abolishing biological or immunological activity may be found using computer programs 
well known in the art, for example, LASERGENE™ software, and comparison against 
the many known CD genes. 
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The term "naturally occurring variant," includes those protein or nucleic acid 
alleles that are naturally found in the population in question. The naturally occurring 
allelic variants may be point, splice, or other types of naturally occurring variations. 

"High Stringency" refers to wash conditions of 0.2X SSC, 0.1% SDS at 6S°C. 
5 "Medium stringency" refers to wash conditions of 0.2XSSC 0.1% SDS at 55°C. 

In calculating "% identity" the unaligned terminal portions of the query sequence 
are not included in the calculation. The identity is calculated over the entire length of the 
reference sequence, thus short local alignments with a query sequence are not relevant 
(e.g., % identity = number of aligned residues in the query sequence/length of reference 
10 sequence). Alignments are performed using BLAST homology alignment as described 
by Tatusova TA & Madden TL (1999) FEMS Microbiol. Lett. 174:247-250. The default 
parameters were used, except the filters are turned OFF. As of Jan. 1, 2001 the default 
parameters were as follows: BLASTN or BLASTP as appropriate; Matrix = none for 
BLASTN, BLOSUM62 for BLASTP; G Cost to open gap default = 5 for nucleotides, 1 1 
15 for proteins; E Cost to extend gap (Integer] default = 2 for nucleotides, 1 for proteins; q 
Penalty for nucleotide mismatch £IntegerJ default = -3; r reward for nucleotide match 
[Integer] default = 1; e expect value [Real] default = 10; W wordsize (Integer] default = 
1 1 for nucleotides, 3 for proteins; y Dropoff (X) for blast extensions in bits (default if 
zero) default = 20 for blastn, 7 for other programs; X dropoff value for gapped alignment 
20 (in bits) 30 for blastn, 15 for other programs; Z final X dropoff value for gapped 
alignment (in bits) 50 for blastn, 25 for other programs. This program is available online 
at http://www.ncbi.nlm.nih.gov/BLAST/ 

Candida kejyr (a.k.a. Candida pseudotropicalis, Kluyveromyces marxianus, 
Kluyveromyces fragilis) CD is an improvement over the several prior art cytosine 
25 deaminase proteins. C. kejyr CD has significantly lower expression levels than E. coii 
CD (to date), but its activity is much higher in converting 5FC to 5FU. Further, 5FC is a 
better substrate for C. kejyr CD than is the natural substrate, cytosine. Table 1 provides a 
listing of sequences taught herein. 



TABLE 1 - SEQ ID NO AND DESCRIPTION 



SEQID 
NO: 


TYPE 


Length 


Name and Description 


1 


cDNA 


456 nt 


Wild type Candida kefyr CD cDNA 


2 


Peptide ; 


152 aa 


Wild type Candida kefyr CD protein 


3 


cDNA 


456 nt 


Variant Candida kejyr CD cDNA with 74C-+T, 99T-*C, 159T-*A, 
243T-*C, 309C-+T, 336A-*T3, 365 A-*G 


4 


Peptide 


152 aa 


Variant Candida kefyr CD protein D33E 


5 


Peptide ; 


158 aa 


Wild type Saccharomyces cerevisiae CD protein 
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459 nt - 
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10 


cDNA 


1104 nt : 


Cjitirlidn kefur CD-uracil nhosnhnribosvltransferase/PURl^ fusion 
nrntpin fDNA 


11 




IS 


Probe 1 


12 


Oligo 


18 


Probe 2 


13 


OUpo 

WllgU 


18 




14 




15 


Probe 4 


15 




15 


Probe 5 

1 1 DUG J 


16 




30 


PK" V I nnp 

V-» JV J l_<UJ Ig 


1 7 


VSIlgU 


30 




18 


OligO 


33 


CK5*Nest 


19 


Oligo 


30 


CK 3* Nest 


20 


Oligo 


39 


CK 5' BamHl 


21 


Oligo 


32 


CK3'Xhol(nostop) 


22 


Oligo 


33 


CK3 f Pstl (stop) 


23 


Peptide 


16 


C. kefyr CD Epitope 1 


24 


Peptide 


18 


C kefyr CD Epitope 2 | 



Other protein variants are described in Table 2 with reference to SEQ ID NO: 2. 
TABLE 2 - CD VARIANTS 



SEQ ID NO:2 Variant 


%H> 


Activity 


D33E-3TLAG 


151/152 
(99%) 


Mutant tested and has less activity than the protein of SEQ ID 
NO: 2 


D33E 


151/152 
(99%) 


Mutant not yet made, but may have decreased activity 


192L/L93I/I97L 


149/152 

<98%) ; 


Expected to have 800 fold less than wild-type because this mutant 
is shown to be less active in other yeast species. See 
WO199960008. 


T80S/TS1S/T89S/Y95S 


148/152 
(97%) 


Expected to have 800 fold less than wild-type, see above. 


S42K/R47K 


150/152 
(99%) 


Expected to have 100 fold less than wild-type, see above. 


K129R/K136R 


150/152 ; 
(99%) 


Expected to have 800 fold less than wild-type, see above. 


Al 


151/152 
<99%) 


Met-free variant, expected to have wild type activity. 


Al-2 


150/152 
(99%) 


Deleting first two amino acids which are missing in the consensus 
protein, expected to have activity. 


Al-8 


144/152 
(95%) ; 


Alternate start codon, expected to have activity. 


A 1-9 J 


143/152 
(94%) 


Met-free variant using alternate start codon, encodes the cytidine " 
and deoxycytidylate deaminase zinc-binding region, expected to 
have some activity. 


Single E to D mutations, ~ 
at positions 3, 38, 114, 138 


151/152 
(99%) 


Not expected to change activity since change conservative and 
residue not conserved in yeast. 


Single D to E mutation at ' 
position 33 


151/152 
(99%) 


Not expected to change activity since change conservative and 
residue not conserved in yeast. 


Single K to R mutation,^ 
esp. at positions 11, 69, 
124, 129, and 141 


151/152 '; 
(99%) 


Not expected to change activity since change conservative and 
residue not conserved in yeast 
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The invention is generally directed to protein and gene or cDNA sequence of C. 
kefyr cytosine deaminase of amino acid sequence of SEQ ID NO: 2 or S EQ ID NO: 4. 
Variations of the cDNA encoding the protein are provided, including an E. coli biased 
CD cDNA (SEQ ID NO: 7), a humanized CD cDNA(SEQ ID NO: 8), and a humanized 
5 and CpG-free CD cDNA (SEQ ID NO: 9). Fusions are also provided, in particular the 
CD-uracil phosphoribosyltransferase fusion {SEQ ID NO: 10) and a CD-FLAG fusion 
(see table 2). 

The nucleic acid sequences can be used in traditional suicide gene therapy 
methodologies. Suicide gene therapies are in phase I, II and m clinical trials, and are 
10 well established treatment supplements or alternatives. The C. kefyr gene provides an 
advantage over current suicide gene sequences because lower amounts of 5FC are 
needed for therapy, due to the protein's improved ability to convert 5FC to 5FU. The 
gene also has uses in preparing large amounts of protein for biochemical 
characterization, preparation of antibodies, and the like. 
15 A large number of variant protein sequences are provided, based on both the 

known homologies with prior art sequences and on the predicted characteristics of the 
protein, as shown in tables 1-3 and Figs. 5 and 6. The range of mutants provided all are 
within 94% amino acid identity to the wild type sequence as disclosed. The closest prior 
art sequence has only 74% amino acid identity to the C. kefyr cytosine deaminase 
20 protein. 

Antigenic fragments of C. kefyr cytosine deaminase are also provided, which 
have already been used to successfully generate antibodies Of the invention. The 
antigenic fragments can be selected to be unique or conserved, as shown in table 5. 
Similarly, fragments of the nucleotide sequence of SEQ ID NO: 1 or 3 can be used as 
25 oligonucleotide probes or as primers in a variety of methods. Larger fragments can also 
be used as probes. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1. Candida kefyr Cytosine deaminase cDNA sequence {SEQ ID NO: 1). 
The cDNA was cloned by PCR amplification using primers from those portions of 
30 sequence that were conserved among the prior art fungal sequences. 

FIG. 2. Candida fe$>rCytosine deaminase amino acid sequence (SEQ ID NO: 2). 
The amino acid sequence was derived from the nucleotide sequence of the cDNA. 

FIG. 3. Variant Candida kefyr Cytosine deaminase cDNA sequence with 74C— >T, 
99T-+C, 159T-*A, 243T-+C, 309C— T, 336A->G, 365A-+G (SEQ ID NO: 3). The 
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variant CD protein was obtained from a different clone amplified using the previously 
described system. 

FIG. 4. Variant Candida kefyr Cytosine deaminase amino acid sequence D33E 
(SEQ ID NO: 4). The amino acid sequence was derived from the nucleotide sequence of 
5 the cDNA in Fig. 3. 

FIG. 5. CD Multiple Sequence Alignment with Candida kefyr -{SEQ ID NO: 2), 
S. cerevisiae (SEQ ID NO: 5), C albicans (SEQ ID NO: 6), and the CD consensus 
sequence. The 5. cerevisiae sequence was obtained from GenBank accession number 
NP_015387. The C. albicans sequence was obtained from GenBank accession number 
10 AAC157S2. The consensus sequence was obtained from the Pfam database of protein 
domains and HMMs (http://pfam.wustl.edu/index.html) accession number PF00383. 
The sequences in FASTA format were aligned according to Higgins (7), using the 
default parameters (http://www.ebi.ac.uk/clustalw). Default settings as of June 20, 2002 
were: CPU mode = clustalwjnp; alignment = full; output format = aln w/ numbers; 
1 5 output order = aligned; color alignment = no; KTUP = def; window length = def; score = 
percent; topdiag = def; pairgap = def; phylogenetie tree = none {off, off); matrix = def; 
gap open = def; end gaps = def; gap extension = def; gap distances = def; tree type = 
cladogram; tree gap distance = hide. 

FIG. 6. Pairwise alignment with & cerevisiae sequence. Sequences were aligned 
20 at http://www.ncbi.nlm.nih.gov/BLAST/ 

FIG. 7. Western Blot Using Anti-FLAG. An anti-FLAG antibody is used to 
confirm expression of the CD constructs, including CK-CD-FLAG and E-CD-FLAG. 
The bacterial protein is much more strongly expressed than the yeast protein. 

FIG. 8. Titration of CK-CD-FLA13 and E-CD-FLAG. 30 of CK-CD-FLAG 
25 Iysate is about equal to 3 p.! of E-CD-FLAG lysate when tested by Western blot with 
anti-FLAG antibody. This suggests that the CK-CD-FLAG protein is about ten-fold less 
well expressed than the E-CD-FLAG. Yet, control experiments <with a co-trans&cted 
GFP containing vector, not shown) show roughly equivalent transfection efficiencies. 

FIG. 9. Cytosine to Uracil Assay. Equal amounts of cell lysate are assayed for 
30 cytosine to uracil conversion at 37°C in a 16 hour assay. Activities of CK-CD-FLAG, 
CK-CD, E-CD-FLAG, and E-CD are shown, and it is apparent thatCK<:D without the 
FLAG tag is much more active than the protein with the FLAG tag. This may represent 
increased expression or activity or some combination thereof. 
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FIG. 10. 5FC to 5FU Assay at Different Temperatures. Equal amounts of cell 
lysate are assayed for cytosine to uracil conversion at 37°C in a 2 hour assay at the 
indicated temperatures. CK-CD is much more active than CK-CD-FLAG, and this may 
reflect better expression, activity or both. Both tagged and untagged yeast CD are more 
5 active against 5FC than the bacterial protein. 

DESCRIPTION OF EMBODIMENTS OF THE INVENTION 
The present invention is directed to the Candida kefyr cytosine deaminase {CK- 
CD) protein and cDNA sequence. Also included are i) E. coli biased and humanized 
DNA sequences encoding CD, ii) antigenic polypeptide fragments and antibodies to 
10 same, iii) variants predicted to retain functional activity based on comparison with the 
large number of proteins in this family and inactive mutants, iv) naturally occurring 
variants, v) fusion proteins, such as FLAG, GFP, luciferase, uracil 
phosphoribosyltransferase, and monoclonal antibody fusions. 

EXAMPLE 1: CLONING AND SEQUENCE CHARACTERIZATION OF CD 
15 Genomic DNA for Candida kefyr was purchased from ATCC. Amino acid 

sequence alignment of known fungal cytosine deaminase genes FCA1 i/C. albicans) and 
FCY1 (5. cerexnsiae) were used to design degenerative primers for regions of the genes 
that were homologous. The oligonucleotides were as follows: 



Primer 


Sequence 


Probe 1 = SEQ ID NO: 11 


ATG TGT ACT GGT OCT ATT ! 


Probe 2= SEQ ID NO: 12 


GGT GAA AAC GTT AAC TTC 


Probe 3 = SEQ ID NO: 13 \ 


GGT GAA AAC GTC AAC TTC 


Probe 4 = SEQ ID NO: 14 


GAA GAT ATT GGT GAA 


Probe 5 = SEQ ID NO: 15 


GAA GAC ATT GGT GAA 



Polymerase chain reaction was performed with the described primers andO.i jig 
20 genomic DNA under the following conditions: 

; lii1(0.1iig)DNA "~ 

"l^ldNTPmix(lOMM) ] 

5^1 IPX Pfu polymerase buffer 

lul Probe 1 (5* primer) (IQuM) 

llil Probe 2, 3, 4, or 5 (3* primer) (lOfiM) 

1 ul Pfu Turbo polymerase (4 units) (STRATAGENE™) 

40ul ddH2Q 

Reaction conditions were as follows: 94°C for 3 minutes <1 cycle), followed by 
25 cycles of 94°C 45 seconds, 40°C for 45 seconds, 72°C for 30 seconds. Ten pi of the 
reactions were run on a 3% agarose gel. Based on the sequence alignment of the known 
CD genes, expected fragment sizes were expected to be between 66 and 200 base paks, 
25 depending on primer set used in reaction, and the CK PCR produced tfie expected 
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fragments. These fragments were excised from the agarose gel and subcloned into 
pCRScript cloning vector (STRATAGENE™, La Jolla, CA). Plasmid DNA was 
purified and analyzed by restriction digest for clones containing correct insert size. The 
plasmid DNA was sequenced by ResGen Laboratories <INyiTROGEN TM , Carlsbad, 
5 CA). 

Based on the sequence of the clones described above, primers were designed and 
synthesized for use in a Genome Walker kit <CLONTECH™, Palo Alto, CA) to isolate 
the full length CD gene of C. kefyr. The oligonucleotides used with the kit were as 



follows: 



Primer 


Sequence * • 


CK 5' Long = SEQ ID NO: 16 


GGCCGTGTTGTCATTGGTGAAAACGTCAAC 


CK 3' Long = SEQ ID NO: 17 J 


G GTCTCTTTTC A ATG AACTCCTTC ATTAT A \ 


CK 5* Nest = SEQ ID NO: 18 


GTTGTCATTGGTGAAAACGTCAACTTCAAAAGC 


CK 3' Nest = SEQ ID NO: 19 ! 


AATGAACTCCTTCATTATATCGATACAGCG 



10 "Libraries" were created and PCR performed as described in the Genome Walker 

kit protocol. The PCR products were subcloned into pcDNA2. 1 ^NVITROGEN™), 
plasmid DNA purified and clones sent for sequencing to ResGen Laboratories. 
Sequences for the 5' and 3' ends of C. kefyr gene were determined and oligonucleotides 
containing restriction enzyme sites (italicized) and a mammalian Kozak sequence ^bold) 

15 were designed and synthesized as follows: 



Primer 


Sequence 


. CKS'BamHl 

< = SEQ ID NO: 20 


CTTGGGATCCGCCACCATGGCTGAATGGGATCAAAAGGG 


CK3'Xhol(nostop) 
= SEQ ID NO: 21 


CTTGCTCXjAGTTCGCCAATGTCTTCGTACCAG 


CK3' Pstl (Stop) 
- = SEQ ID NO: 22 


CITGCTGCAGCTATTCGCCAATGTCTTCGTACC 



The full length CD gene was isolated via PCR from C. kefyr genomic DNA under 
the following conditions: 

2\jl\ (0.2ug) genomic DNA ; 

lul dNTP mix (IOjaM) ; ; 

: 5nl IPX Pfu rxn buffer : 

5 1 primer (lOuM) 

4 3 f primer (lOuM) . 

* \\x\ Pfu Turbo DNA polymerase (4 units) , 

39ulddH2Q 

25 cycles at 94°C for 30 seconds, 65°C for 30 seconds and 72°C for 1 minute 
20 were run in a thermocycler. PCR products were analyzed by agarose gel with ethidium 
bromide staining. The fragments were digested with the appropriate TCStriction-enzymes 
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and subcloned by standard methods into pCMV4A vector <STRATAGENE™). The 
sequence of cytosine deaminase gene was confirmed by ResGen Laboratories. 

The clone containing the gene was deposited at ATCC PTA-4S67. The cloned 
insert was sequenced and is presented in Fig. 1, the translation product in Fig. 2. A 
5 naturally occurring allelic variant is shown in Figs. 3 and 4. The protein of SEQ ID NO: 
2 is 152 amino acids and is characterized as follows: 

TABLE 3: GENERAL CHARACTERISTICS 
Number of amino acids: 152 

Molecular weight: 16819.1 . 

Theoretical pi: 4.95 



Amino acid composition: 



Ala 6 
3.9 % 


Arg9 
3.9 % 


Asn 5 
3.3 % 


Asp 9 * 
5.9% . 


Cys5 
3.3% . 


Gin 2 
1.3% 


Glu 16 
10.5% 


GIy20 
13.2% 


His 3 
2.0% 


Hell 

7.2 % 


Leu 11 

7.2 % 


Lys 11 

7.2 % 


Met 6 
3.9 % 


Phe3 
2.0% ; 


Pro 5 

3.3 % 


Ser 7 : 
4.6% 


Thr9 

5.9% 


Tip 2 

1.3 % 


Tyr7 
4.6% 


Val 8 

5.3% 


AsxO 


GixO 


XaaO 

















Total number of negatively charged residues (Asp + Glu): 25 



Total number of positively charged residues (Arg + Lys): 17 . 

] Formula: C735H1156N196Q233S1 1 Total number of atoms: 2331 

, Estimated half-life: The N-terminal of the sequence considered is M (Met). The estimated half-life is: 30 hours 
(mammalian reticulocytes, in vitro). >20 hours (yeast, in vivo). >10 hours (Escherichia coli, in vivo). Instability 
index: The instability index (II) is computed to be 35.20. This classifies the protein as stable. Aliphatic index: 

- 75.66 

Grand average of hydropathicity (GRAVY): -0.4 1 0 - 

There are no predicted myristolation sites or n-terminal signal sequences. 

There is a potential cleavage site at amino acid 1 18. 

EXAMPLE 2: EXPRESSION OF CD 
Human embryonic kidney cells (HEK 293 cells) were transiently transfected with 
10 one of these constructs: CK-CD-FLAG, CK-CD, E-CD-FLAG, E-CD or vector alone 
(pCMV-tag4C). Control experiments were performed to confirm that each construct was 
transfected with comparable efficiency. Cells co-trans&cted with a construct and a GFP 
containing plasmid indicated approximately equivalent levels of transfection. 

The experimental details are as follows: 293 cells in 10 cm dishes were 
15 transfected with 10 or 15 ^g of CD constructs using FuGene 4 (ROCHE™) at a 
DNA/FuGene ratio of 1:3. No carrier DNA was used. -Cell lysates were collected in 1 ml 
of lysis buffer (20 raM Tris-Cl, pH8, 150 mM NaCl, 1% Triton X-100) after 48 hours by 
three freeze/thaw cycles using dry ice for 10 minutes and room temperature for 10 
minutes. 

20 30 ill of cell lysate was separated by 12% SDS-PAGE, transferred to membrane 

and then Western blotted with anti-flag monoclonal antibody at a -dilution of 1:5000 
(SIGMA™). The membranes were incubated with HRP conjugated goat-anti-mouse 
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secondary antibody (1:10000, AMERSHAM™) and the signal visualized with an ECL 
system (AMERSHAM™). The data in Fig. 7 shows that & coli CD-FLAG is expressed 
much more efficiently than Candida kefyr CD-FLAG, although they were constructed in 
the same vector Similar results were obtained <not shown) when the cells weie 
5 transfected with 2 fig of each DNA construct. Experiments are planned to test a 
humanized C. kefyr gene and it is expected that the humanized gene will improve the 
expression levels. 

Because the bacterial protein was more strongly expressed than the yeast protein, 
an attempt was made to compare the levels for the amount of expression of the two 

10 clones. In a second Western blot, 30 \il of CK-CD-FLAG was compared against 
increasing amounts of E-CD-FLAG. As seen in Fig. 8, 30 jil of CK-CD-FLAG (-18KD 
band) is roughly equivalent to 4 |il of E-CD-FLAG (-60 KD band), indicating about 7.5- 
fold better expression. In the following experiments, this ratio of cell lysates is used. 
The presence of a fainter band at about 38 KD may indicate that some portion of the CK- 

15 CD protein dimerizes. 

EXAMPLE 3: ENZYMATIC ASSAY 
For conversion assays, 1 p.1 of 100 jtCi/ml either !4 C-cytosine or ,4 C-5FC 
(MORAVEK BIOCHEMICALS™) was added to 45 yd of yeast CD or 6 yd of bacterial 
CD cell lysates and incubated at the indicated temperatures for 2 or 16 hours. Hie 
20 reaction mixtures were loaded onto a TLC plate (LK5DF SILICA GEL, Cat. No. 4856- 
821, WHATMAN™) and resolved with l-butanol:H 2 0 (85:15) for 3 hours. The plate 
was then dried and visualized by autoradiography. However, CD activity can also be 
assayed by 19F nuclear magnetic resonance (NMR) as described by De Vito (2) and 
Martino(17). 

25 Cell lysates were assayed for cytosine to uracil conversion at 37°C in a 16 hour 

assay. The activities of CK-CD-FLAG (wife the D33E variation), CK-CD, E-CD- 
FLAG, and E-CD are shown in Fig. 9. It is apparent that CK-CD without the FLAG tag 
is much more active than the protein with the tag (compare lanes 1 and 2). This may 
represent increased expression or increased activity or some combination thereof. This 

30 may also reflect the D33E mutation in the CD-FLAG variant. Because the CK-CD 
protein is very small, it is anticipated that the FLAG may inhibit its activity, although the 
prior figure also indicates that expression is also inhibited. 

The C. kefyr CD protein was found to be far less active against cytosine than was 
the E. coli CD protein (compare lanes 2 and 4). ITiis probably reflects the fact that the 
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assay conditions employed were optimized for the E. coli protein. Further, although 
bacterial activity seems to increase with increased reaction time^not shown), that of the 
yeast protein does not. This is confirmed in an independent assay (not shown), and may 
indicate that under these conditions the yeast enzyme is slightly less stable than the 
5 bacterial enzyme. 

The S. cerevisiae protein is also known to be thermally instable (measured T\n = 
1 hr, (16)). However, structural analysis indicates that both yeast proteins should be 
more stable in mammalian cells (calculated T\a >30 hours, see Tables 3-4). Thus, we 
expect that the stability can be improved by optimizing the reaction conditions and this 

10 work is underway. 

In the next experiment, we tested the activity of each enzyme against 5FC. The 
experiment was performed as described above, but using 5FC in place of cytosine. 
Further, this experiment was combined with a temperature optimization study and was 
performed at 4 different temperatures. As shown in FIG. 10, the yeast protein had a 

15 lower temperature optima than the bacterial protein, probably reflecting typical ambient 
growth conditions for each organism. However, the C kefyr protein was sufficiently 
active at 37°C for it to be useful in human therapies. Further, a variant with increased 
stability can be isolated by screening yeast grown at 37°C, as described in the Examples 
below. 

20 Although the CK-CD protein was less active against cytosine,. it is unexpectedly 

more active against 5FC than is the E. coli CD at 2 hours. This is shown in Fig. 10 
where the yeast protein shows much better conversion of 5FC to 5FU (compare lanes 1 
and 3 or 2 and 4). At 16 hours of reaction, the bacterial protein produces as much or 
more 5FU than the yeast protein (not shown), but as discussed above, this probably 

25 reflects the better stability of the bacterial protein under the -reaction conditions 
employed. 

The experiment demonstrates that yeast protein is more active than the bacterial 
protein against the pro-drug 5FC. Thus, the Candida kefyr CD gene and protein will 
provide an advantage in suicide gene therapy, because decreased dosage of 5FC can be 
30 employed while still achieving cytotoxicity. Experiments are underway to -confirm that 
the specific activity of the yeast protein against 5FC is significantly greater than that of 
the bacterial protein. 

Using the above assay, the enzyme was further characterized and compared 
against the existing cytosine deaminase proteins. The results are as follows: 
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TABLE 4: YEAST CD COMPARISON 



CD Species 


AM 
INO 
ACI 
D 


% identity ' 


Tl/2 
HR 


Preferred Buffer 


Vmax 

nM/min/ug 

enzyme 


Km 
mM 


Activity 

% 5FU/5FCby ■ 
0.2 jig CD in 2h 


Candida 
kejyr 


152 


100% 


>30 ' 


n/a 


n/a 


n/a 


-50% by 45 ul ; 
cell extract 


S. cerevisiae 
FCY1 


158 


74% 
131/151 


>30 
<"1) 


lo 100mMTris, 
pH7.8, 1 mM 
EDTA 


,6 68.0±12.0 


16 0.8±0.2 


!t> 77.9 ± 6.2 


C. albicans 
FCA1 


150 


58% 
86/148 


N/A 


N/A 


N/A 


N/A 


N/A 


E. coli 
CD 


427 


none 


N/A 




,6 11.7±3.S 


lo l7.9+-4.4 


l6 16.0±0.8 



EXAMPLE 4: CYTOTOXICITY ASSAY 
The radiosensitizing effect of 5FC and 5FU in HT29, HT29/bCD, and HT29/yCD 
cells is determined using a standard clonogenic assay (3). Cells are treated with 5FC or 
5 5FU at various concentrations for 24 h before irradiation at 37°C in media -containing 
10% dialyzed serum. The radiation survival data are corrected for plating efficiency 
using a nonirradiated plate treated with 5FC or 5FU under the same conditions. The 
surviving fraction is plotted against the radiation dose, and -curves fit using the linear- 
quadratic equation. The radiation sensitivity is expressed as the MED, which represents 

10 the area under the cell survival curve (I). Radiosensitization was expressed as the ER, 
which is defined as MIDcontroi/MIDireated. 

To determine the cytotoxic and radiosensitizing effect of 5FC and 5FU on 
bystander cells, cocultures of 90% bystander hygromycin-resistant HT29-cells and 10% 
puromycin-resistant CD-transduced HT29 cells are used. Cell survival of the 

15 hygromycin-resistant HT29 cells and puromycin-resistant CD-transduced HT29/cells is 
determined by plating the cells in selective media after treatment and assessed using a 
standard clonogenic assay as described above. 

Cytotoxicity assay - transfected cells were seeded at a density of 1 x IX) 3 
cells/well in a 96-well microtiterplate containing LOO p,l of culture medium. A -set of 

20 sterile stock dilutions of the 10 mg/ml 5FC solution were prepared. One day later, 
increasing concentrations of 5*FC was added to the wells, and a control well without the 
prodrug was included. After 5-7 days, the cells were washed with &esh medium and 
cytotoxicity was assessed by trypan blue exclusion, using a hemocytometer to quantify 
the results. The results are expected show increased cytotoxicity and bystander efiect 

25 (up to 10 fold) per unit dose of 5FC compared with the bacteria, due to the increased 
conversion rate. Similarly, increased activity is expected in the CD^FURl fusion 
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protein. It is not known how the C. kefyr protein will compare with the S. ceresnsiae 
protein, but it is known that this protein has a 22-fold lower Km and a 4-fold higher 
Vmax for 5FC than bacterial CD protein. Urns, the activities are expected to be roughly 
equivalent or perhaps somewhat better in C. kefyr, once the assay is optimized for C. 

5 kefyr. 

EXAMPLE 5: ANTIBODIES 

The peptidic fragments listed in the Table are synthesized and used to inject 
rabbits. Polyclonal antibodies are prepared therefrom and screened for activity. The best 
samples are chosen to prepare monoclonal antibodies. 
10 TABLE 5. ANTIGENIC FRAGMENTS OF CD 



Residues from 
SEQK)NO:2 


Marker 


Use 


42-56 
83-94 


Conserved residues 


Broad-range antigenic fragments allow monitoring of CD 
from a variety of species. 


32-43 
5S-79 
110-139 


C. kefyr specific 


Specific antigenic fragments allow monitoring of-C. kefyr 
CD only. 


1-16 




N-terminal detection (SEQ ID NO: 23) 


M-136-152 




C-terminal detection (SEQ ID NO: 24) 1 



To date, we have prepared a polyclonal antibody to the peptides from the amino 
and carboxyl terminals of the protein, according to standard techniques. The peptides 
were Met-Ala-Glu-Trp-Asp<jln-Lys-Gly-M (SEQ ID 

NO: 23) and Cys-Lys-Glu-Phe-Ile-Glu-Lys 
15 Glu (SEQ ID NO: 24). Work is planned to isolate monoclonal antibodies for each. 

EXAMPLE 6: VARIANTS 
The specific variants listed in table 2 are synthesized by site specific mutagenesis 
of SEQ ID NO: 1 or SEQ ID NO: 3. Additional variants may be made, however, it is 
suggested that the conserved residues indicated by the black boxes in Fig. 5 or 6 not be 
20 changed, unless one desires an inactive mutant. Further, changes within the cytidine and 
deoxycytidylate deaminase zinc-binding region <grey box) are also expected to be Jess 
well tolerated than changes outside this region. Residues that might be important to the 
C. kefyr CD are indicated in Fig. 6 as bold. These residues involve a change in charge as 
compared with the closest homolog and are in the binding region. Changes in these 
25 residues are expected to change function. Active variants are likely to be those that 
involve conservative changes in those residues not conserved in the 2 yeasts, particularly 
those outside the binding region. 
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Variants are expressed in E. eoli and screened for activity using the assay 
described in example 3, or any suitable assay. Alternatively, random mutagenesis is 
performed and the products are similarly screened. In this manner, it will be possible to 
isolate variants with improved temperature stability at 37°C and mutants with even better 
5 activity than that described herein. 

Naturally occurring variants are isolated by screening populations of Candida 
kefyr with the cDNA of SEQ ED NO: 1 or 3 at high stringency. Alternatively, natural 
alleles can be isolated by ASO {allele specific oligonucleotide) screening using an array 
of overlapping oligonucleotides that provide complete coverage of SEQ ID NO: 1 or 3. 
1 0 In yet a third alternative, mutants with higher activity -can be isolated by rescue screening 
yeast CD mutants grown on cytosine, as a source of pyrimidine. 

EXAMPLE 7: CD-FUR1 FUSION 
A fusion protein of the Candida kefyr CD gene and the uracil 
phosphoribosyltransferase genes (FUR1) is constructed as shown in SEQ ID NO: 10. A 
15 similar construct was made with the Saccharomyces cerevisiae CD <FCY1) (15). The 
FC Y 1 -FUR1 fusion encoded a Afunctional chimeric protein that efficiently catalyzed the 
direct conversion of 5FC into the toxic metabolites 5FU and 5-FdUMGP, thus bypassing 
the natural resistance of certain human tumor cells to 5FU. Unexpectedly, the cytosine 
deaminase activity of the fusion proteins was 100-fold higher than the wild type, 
20 resulting in greatly increased sensitivity to concentrations of 5FC (1000-foH increased 
sensitivity). Furthermore, the bystander effect was also more effective with the fusion 
protein than either FCY1 or FUR1 alone or in combination. Because the Candida kefyr 
gene is 74% identical to the S. cere\>isiae gene, it is expected to function similarly and 
experiments will be performed to confirm this. 
25 Another type of multimodality therapy -can be achieved with a replication- 

conditional herpes simplex virus 1 mutant, where the viral ribonucleotide reductase gene 
is disrupted by sequences encoding yeast cytosine deaminase. HSVlyCD-infected cells 
convert 5FC to 5FU without significantly reducing viral replication and oncolysis. 
HSVlyCD-infected cells are destroyed by viral replication, and uninfected cells are 
30 subjected to bystander killing from both progeny virion and extracellular diffusion of 
5FU. This has been shown to increase anti-tumor effect. XI 8). 

EXAMPLE 8: CD-MONOCLONAL ANTIBODY FUSION 
CD can be covalently attached to monoclonal antibodies, forming conjugates that 
bind to antigens on tumor cell surfaces and thus targeting the CD to a specific oell type. 
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This experiment has been performed with fee S. *cerevisiae CD and the combination was 
specific for the antibody target <5). A similar effect can be achieved by expressing a CD- 
Monoclonal antibody fusion protein. 

EXAMPLE 9: SUICIDE GENE THERAPY 
5 Preliminary suicide gene therapy in vivo results will be obtained using the nude 

mouse tumor model. The human colon cancer cell line HT29 will be grown in RPMI 
supplemented with 10% heat-inactivated calf serum, 2 mM glutamine, 100 IU/ml 
penicillin and 100 mg/ml streptomycin. Stable HT29 cell lines expressing either bacterial 
or various yeast CD genes (including the humanized C kefyr CD gene) will be generated 
10 by viral infection using the retroviral vector LZR (Lazarus), or a gene-viral vector based 
on the adenovirus (19). Cells will be reseeded 24 h after infection to allow the formation 
of single colonies, which will be isolated and tested for CD activity. CD-positive clones 
will be used to generate mice tumor models. 

Nude female mice {Nu/Nu CD-I, Charles River Laboratories, Wilmington, MA) 
15 of 7-8 weeks will receive injections <s.c.) in the flank with 5 x 10 6 viable HT29-CD 
cells, generated above. Tumors will be measured biweekly with calipers in 2 
dimensions. Tumor volumes will calculatedin mm 3 using the formula: (3.1 4/6) <£ X W\ 
When tumors are >50 mm 3 and measure an average volume of 100-150 mm 3 , treatment 
will be started. Mice will receive injections daily <i.p.) with 500 mgfleg 5FC or 25 mg/kg 
20 5FU 5 days a week for 2 weeks. Differences in the efficacy between treatments will be 
measured. 

Suicide gene therapy has already been tested in several phase I and phase H 
clinical trials, and both safety and moderate efficacy have been shown. However, there 
is room for improvement of both transfection efficiencies and gene expression. It is 

25 anticipated that the use of a gene that codes for a more active 5FC to 5FU converting 
enzyme will provide benefit in suicide gene therapy, allowing the use of lower doses of 
5FC. Clinical experiments with the new C. kefyr CD gene will not be undertaken for 
some time. In the interim, work is underway to optimize the reaction conditions for the 
C. kefyr CD protein, to confirm its cytotoxicity in cell toxicity assays, and in the nude 

30 mouse xenographic tumor model described above. It is anticipated that the C. kefyr CD 
gene will provide a benefit over the established bacterial gene, and may also prove to be 
an improvement over the S. cerevisiae gene. 
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EXAMPLE 11: TUMOR RESPONSE MONITORING 
The C. kefyr CD gene or its variants can be used to test an individual tumor cell's 
response to suicide gene therapy in vitro. Candidate tumor cells are transfected as above, 
and their responsiveness to the therapy assayed either in vitro or in me mouse tumor 
5 model. The tumor cells can be established tumor cell lines, or tumor cells biopsied from 
an individual. In this way, tumors most likely to benefit from suicide gene therapy using 
the CK gene can be identified. 

CITES: All citations are hereby expressly incorporated by reference and are 
relisted here for convenience: 
10 1. Fertil B,et al., Radiat. Res. (1984) 99: 83-93. 

2. Di Vito M, et al, Antimicrob. Agents Chemotlter. (19S6) 29(2):303-8. 

3. Lawrence TS. Cancer Res. ' (1988) 48: 725-730. 

4. YhihcxBE, etal.,Proc Natl Acad Sci USA. i\99\) 88:8039-8043. 

5. SenterPD, etal, Bioconjug. C/iem. (1991) 2(6):447-51. 
15 6. Freeman SM, etal, Cancer Res. (1993) 53:5274-5283. 

7. Higgins DG, et al, Nucleic Acids Res. (1994) 22:4673-80. 
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30 What is claimed is: 
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11. An isolated nucleic acid, comprising a sequence that-encodes a C. kejyr cytosine 

2 deaminase of SEQ ID NO: 2. 

1 2. The nucleic acid of claim 1 , wherein the nucleic acid is cDNA. 

1 3. The nucleic acid of claim 2, wherein the nucleic acid comprises SEQ ID NO: 1 

2 (wild type CD cDNA). 

1 4. The nucleic acid of claim 2, wherein the nucleic acid comprises SEQ ID NO: 7 

2 (E. coli biased CD cDNA). 

1 5. The nucleic acid of claim 2 9 wherein the nucleic acid comprises SEQ ID NO: 8 

2 (humanized CD cDNA). 

1 6. The nucleic acid of claim 2, wherein the nucleic acid comprises SEQ ID NO: 9 

2 (humanized and CpG-free CD cDNA). 

1 7. The nucleic acid of claim 2, wherein the nucleic acid comprises SEQ ID NO: 1 0 

2 (CD- uracil phosphoribosyltransferase fusion). 

1 8. The nucleic acid of claim 2, wherein the nucleic acid is optimized for expression 

2 in human cells. 

1 9. The nucleic acid of claim 2, wherein the nucleic acid is optimized for expression 

2 in bacterial cells. 

1 1 0. The nucleic acid of claim 2, wherein the nucleic acid is operably fused to a 

2 nucleic acid encoding uracil phosphoribosyltransferase; and a promoter. 

1 11. An isolated nucleic acid, comprising the sequence of the insert at ATGC PTA- 

2 4867. 

1 12. An isolated nucleic acid, encoding a protein comprising the sequence of fee insert 

2 atATCCPTA-4867. 

1 13. An isolated nucleic acid, encoding a protein consisting of the sequence of the 

2 insert at ATCC PTA-4867. 



1 14. 



A vector comprising the nucleic acid of any of claims 1-13 operably linkedto ; 
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2 promoter. 

1 15. The vector of claim 14 wherein the operably linked promoter allows tissue 

2 specific expression in human tissues. 

1 16. The vector of claim 14 wherein the operably linked promoter allows preferential 

2 expression in human cancer tissues. 

1 17. An isolated nucleic acid that encodes an antigenic fragment of C. kefyr cytosine 

2 deaminase of SEQ ID NO: 2, wherein the fragment comprises a sequence that is at least 

3 17 amino acids in length and less than 152 amino acids in length. 

1 18. The nucleic acid of claim 1 7, wherein the fragment is unique to C. kefyr cytosine 

2 deaminase. 

1 19. The nucleic acid of claim 1 7, wherein the fragment is -conserved in at least one 

2 other fungal cytosine deaminase. 

1 20. An isolated nucleic acid that encodes a variant of C. kefyr cytosine deaminase as 

2 described in table 2. 

1 21 . An isolated nucleic acid that encodes a naturally occurring variant of C. kefyr 

2 cytosine deaminase with at least 94% identity to SEQ ID NO: 2 over the entire length of 

3 SEQ ID NO: 2. 

1 22. An isolated nucleic acid comprising a sequence that encodes a protein with at 

2 least 94% identity over the entire length of SEQ ID NO: 2. 

1 23 . An isolated protein, comprising a sequence with at least 94% identity over the 

2 entire length of SEQ ID NO: 2. 

1 24. The protein of claim 23, wherein the sequence has at least 95% identity. 
1 25. The protein of claim 23, wherein the sequence has at least 9S% identity. 
1 26. The protein of claim 23, comprising SEQ ID NO: 2. 
1 27. The protein of claim 23, consisting of SEQ ID NO: 2. 

1 28. An isolated protein that comprises a variant of C. kefyr cytosine deaminase as 
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1 29. The protein of claims 23-28, which is substantially purified. 

1 30. The protein of claims 23-28, which is purified. 

1 31. An antibody that specifically binds to an antigenic fragment of SEQ ID NO:2. 

1 32. A method of making C kefyr cytosine deaminase protein, comprising growing a 

2 host cell comprising the vector of claim 14 in a nutrient medium and collecting the 

3 expressed C. kefyr cytosine deaminase protein. 

1 33. An isolated oligonucleotide, which is at least 22 nucleotides in length and which 

2 has a sequence of at least 22 consecutive nucleotides from SEQ ID NO: 1 . 

1 34. An isolated nucleic acid, which is at least 100 nucleotides in length and which 

2 has a sequence of at least 100 consecutive nucleotides from SEQ ID NO: i . 

1 35. An isolated oligonucleotide, which is at least 22 nucleotides in length and which 

2 has a sequence of at least 22 consecutive nucleotides from SEQ ID NO: 3. 

1 36. An isolated nucleic acid, which is at least 100 nucleotides in length and which 

2 has a sequence of at least 1 00 consecutive nucleotides from SEQ ID NO: 3. 
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